Characterizing the (Perceived) Newsworthiness of Health Science Articles: A Data-Driven Approach

نویسندگان

  • Ye Zhang
  • Erin Willis
  • Michael J Paul
  • Noémie Elhadad
  • Byron C Wallace
چکیده

BACKGROUND Health science findings are primarily disseminated through manuscript publications. Information subsidies are used to communicate newsworthy findings to journalists in an effort to earn mass media coverage and further disseminate health science research to mass audiences. Journal editors and news journalists then select which news stories receive coverage and thus public attention. OBJECTIVE This study aims to identify attributes of published health science articles that correlate with (1) journal editor issuance of press releases and (2) mainstream media coverage. METHODS We constructed four novel datasets to identify factors that correlate with press release issuance and media coverage. These corpora include thousands of published articles, subsets of which received press release or mainstream media coverage. We used statistical machine learning methods to identify correlations between words in the science abstracts and press release issuance and media coverage. Further, we used a topic modeling-based machine learning approach to uncover latent topics predictive of the perceived newsworthiness of science articles. RESULTS Both press release issuance for, and media coverage of, health science articles are predictable from corresponding journal article content. For the former task, we achieved average areas under the curve (AUCs) of 0.666 (SD 0.019) and 0.882 (SD 0.018) on two separate datasets, comprising 3024 and 10,760 articles, respectively. For the latter task, models realized mean AUCs of 0.591 (SD 0.044) and 0.783 (SD 0.022) on two datasets-in this case containing 422 and 28,910 pairs, respectively. We reported most-predictive words and topics for press release or news coverage. CONCLUSIONS We have presented a novel data-driven characterization of content that renders health science "newsworthy." The analysis provides new insights into the news coverage selection process. For example, it appears epidemiological papers concerning common behaviors (eg, alcohol consumption) tend to receive media attention.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data-Driven Approaches to Improve the Quality of Clinical Processes: A Systematic Review

Background: Considering the emergence of electronic health records and their related technologies, an increasing attention is paid to data driven approaches like machine learning, data mining, and process mining. The aim of this paper was to identify and classify these approaches to enhance the quality of clinical processes. Methods: In order to determine the knowledge related to the research ...

متن کامل

Forecasting Ozone Density in Tehran Air Using a Smart Data-Driven Approach

Introduction: As a metropolitan area in Iran, Tehran is exposed to damage from air pollution due to its large population and pollutants from various sources. Accordingly, research on damage induced by air pollution in this city seems necessary. The main purpose of this study was to forecast ozone in the city of Tehran. Considering the hazards of ozone (O3) gas on human health and the environmen...

متن کامل

The clinical relevance and newsworthiness of NIHR HTA-funded research: a cohort study

OBJECTIVE To assess the clinical relevance and newsworthiness of the UK National Institute for Health Research (NIHR) Health Technology Assessment (HTA) Programme funded reports. STUDY DESIGN Retrospective cohort study. SETTING The cohort included 311 NIHR HTA Programme funded reports publishing in HTA in the period 1 January 2007-31 December 2012. The McMaster Online Rating of Evidence (MO...

متن کامل

A Corpus-driven Food Science and Technology Academic Word List

The overarching goal of this study was to create a list of the most frequently occurring academic words in Food Science and Technology (FST). To this end, a 4,652,444-word corpus called Food Science and Technology Research Articles (FSTRA), which included 1,421 research articles (RAs) randomly selected from 38 journals across five sub-disciplines in FST, was developed. Frequency and range-based...

متن کامل

Occupational Health promotion throughout the synergy between ergonomics and sustainable development aspects

One of the main goals of all societies whether in developed or developing countries is sustainable development and quality of life improvement. Both of the mentioned fields are known as critical subjects for urban planners, health care systems authorities, organizations and industrial sectors managers. Sustainable development is a global and human-centered approach. Also, ergonomics as a multid...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2016